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Computer-assisted  Instruction  (CAI)  has  traditionally  used  the  visual  display  terminal  (VOT) 
as  the  medium  to  convey  Information  to  the  subject.  The  present  experiment  explored  the 
possibility  of  using  computer-generated  speech  (CGS)  with,  and  without,  the  VDT  to  convey 
Information  to  the  subject.  The  two  CGS  conditions  were  contrasted  with  more  traditional 
teaching  methods  (l.e.,  human  narrative,  and  narrative  with  a  script).  Using  the  four  different 
methods  described  above,  a  total  of  160  subjects  were  presented  Identical  Instructions  to  a 
cognitive  task.  Significant  differences  existed  on  task  restarts  and  persistence  among  the 
conditions.  Significant  gender  differences  also  occurred  In  the  time  taken  to  make  the  Initial 
move  on  the  cognitive  task.  The  results  suggest  that  CGS  can  be  a  viable  alternative  to 
traditional  teaching  methods,  and  the  C6S  has  potential  to  be  Incorporated  as  another  dimension 
of  CAI. 
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EDUCATIONAL  OPPORTUNITIES  ASSOCIATED  WITH 
COMPUTER-ASSISTED  INSTRUCTION  AND 
COMPUTER-GENERATED  SPEECH 


I.  INTRODUCTION 

Man y  articles  have  been  written  that  suggest  computers  are  likely  to  have  a  major  impact  on 
society  In  general  and  on  the  behavioral  sciences  In  particular  (Bartram  A  Bayllss,  1984;  Bennett 
A  Maher,  1984;  Brady  A  Hill,  1984;  Mathews,  1980;  Rind  A  Solity,  1984;  Szyperskl,  Grochla, 
Richter,  A  Weltz,  1983;  Zajonc,  1984).  Behavioral  scientists  are  expected  to  play  an 
ever-increasing  role  In  proactive  evaluations  of  how  new  technology  Inpacts  on  hunan  behavior 
(Elworfc  A  Gutkln,  1985).  This  proactive  concept  can  be  traced  back  to  the  early  senlnal  work  of 
conputer-asslsted  Instruction  (CAI)  which  began  In  1926,  when  Pressey  designed  a  mechanical 
device  that  could  automatically  provide  drill  and  practice  Items  to  students  (Pressey,  1926). 

The  early  contribution  Pressey  made  by  Integrating  the  notions  of  machines  and  learning,  and 
his  Introduction  of  a  mastery  paradigm  Into  his  "teaching  machines,"  are  Important  concepts 
(Pagllaro,  1983).  Pressey  used  state-of-the-art  equipment,  during  his  era,  to  construct  the 
forerunner  of  CAI  as  we  know  It  today.  The  present  experiment  also  uses  state-of-the-art 

equipment  to  explore  the  logical  use  of  computer-generated  speech  (CGS),  with  the  microcomputer, 
to  Investigate  Its  potential  use  within  the  realm  of  CAI. 

A  discussion  of  the  evolution  of  CAI  or  teaching  machines  would  be  Incomplete  without 

mentioning  Skinner's  IBM  Teaching  Machine  Project  from  the  1950's,  which  set  the  stage  for 

advances  that  were  made  In  the  1960's,  (e.g.,  IBM  1500  Instructional  System  and  PLATO)  (Rath, 
Anderson,  A  Bralnerd,  1959;  Skinner,  1958;  Suppes  A  Mornlngstar,  1969).  The  1970's  saw  further 
gains  In  both  CAI  theory  (e.g.,  TICCIT)  and  In  hardware,  as  new  state-of-the-art  microcomputer 
equipment  was  developed. 

In  the  recent  past,  CAI  has  been  largely  restricted  to  visual-only  presentations  using  the 
visual  display  terminal  (VOT).  However,  since  the  evolution  of  CGS  into  a  high-quality, 

affordable  product.  It  Is  logical  to  examine  how  CGS  could  be  adapted  for  CAI  or  other  training 
purposes.  If  CGS  was  found  to  be  as  understandable  as  the  more  traditional  methods  of  conveying 
knowledge  (l.e.,  human  voice  and/or  human  voice  plus  script),  then  considerable  cost  savings 
could  be  realized  In  the  education  and  training  coamiunltles  of  Industry,  academia,  and  the 
military. 

Considering  the  multlmllllons  of  dollars  spent  annually  on  training  costs  In  an  attempt  to 
convey  knowledge  In  an  accurate  manner,  any  cost  saving  can  have  major  economic  implications  for 
financial  planners.  This  economic  concept  becomes  even  more  dramatic  when  the  portability  of 
microcomputers  is  coupled  with  CGS.  This  logical  marriage  of  technology  results  in  a  unique, 
exportable  education  and  training  package  that  provides  additional  rel nf orcement  to  the  trainee 
by  using  another  of  the  body's  senses  (l.e.,  hearing)  to  augment  the  three  modes  of  the  CAI 
Instructional  paradigm:  rule,  example,  and  practice. 

CGS  Is  rapidly  finding  more  uses  In  society  as  computer  technology  continues  to  make  people's 
lives  more  efficient.  CGS  Is  currently  being  used  In  numerous  cost-saving  ways  throughout  the 
United  States  and  In  other  modern  societies.  For  example,  the  banking  Industry  is  using 
computer-generated  speech  for  their  automated  banking  tellers,  which  were  nonexistent  only  5 
years  ago.  Supermarket  registers  are  using  CGS  to  “electronically  verbalize"  the  Item  prices  to 
their  customers.  "Electroverbal Izatl on"  Is  being  used  In  shopping  malls  at  the  computerized 
Information/directory  stations.  Airport  terminals  and  subways  are  also  using  CGS  for  their 
passengers'  convenience  and  safety  by  Informing  travelers  of  departure  and  arrival  information, 
and  when  shuttle  bus  doors  will  be  closing. 


These  examples  demonstrate  the  current  uses  of  CGS  to  convey  accurate  and  timely  Information 
to  people  In  their  dally  transactions,  while  simultaneously  reducing  the  workload  of  service 
providers  (e.g.,  businessmen  and  merchants).  It  appears  that  CGS  will  continue  to  Influence 
society  In  ever-increasing  ways  -  from  recreation  (childrens  toys),  and  dally  business 
transactions,  to  Innovative  uses  for  the  handicapped. 

One  fundamental,  often-unstated  premise  In  any  learning  or  training  scenario  Is  whether  or 
not  the  student  comprehends  the  Information  or  Instructions  that  have  been  transmitted.  If 
different  techniques  or  methods  of  presenting  the  desired  Information  would  result  In  ?  more 
rapid  comprehension  of  the  material,  then  training  cost  savings  (time  and  money)  could  be 
realized.  Furthermore,  If  one  of  these  different  methods  of  Instructional  presentation  used  CGS, 
and  resulted  In  similar  (or  even  better)  student  comprehension  than  does  the  traditional  teacher- 
student  Interaction,  personnel  costs  associated  with  training  could  also  be  reduced.  A  proactive 
behavioral  science  question  arises:  How  can  computer-generated  speech  best  be  used  to  assist  In 
education  and  training  domains?  In  order  to  Investigate  differences  In  performance  associated 
with  different  methods  of  Instructional  presentation  (l.e.,  computer-generated  speech  versus 
human  voice),  the  following  experiment  was  conducted. 


II.  METHOD 


Subjects 


One  hundred  sixty  male  and  female  subjects  were  presented  with  four  different  experimental 
manipulations.  Male  and  female  subjects  were  equitably  distributed  across  each  of  the  four 
experimental  conditions.  All  subjects  were  randomly  selected  from  the  Air  Force  Basic  Military 
Training  School  at  Lackland  AFB,  Texas,  and  were  randomly  assigned  to  one  of  the  four 
experimental  conditions. 


Manipulation  Check 

A  check  was  made  to  ensure  that  the  four  experimental  manipulations  would  not  be  compromised 
by  subjects'  having  prior  knowledge  of  the  conceptual  task.  As  the  subject  was  seated,  the 
experimenter  asked  the  subject  If  he/she  was  familiar  with  the  "Pyramid  Task"  or  (as  It  Is  also 
referred  to  In  the  literature,  "The  Towers  of  Hanoi*  Game)  (Baum,  Shaeffer,  Lake,  Fleming,  A 
Collins,  1985).  The  two  subjects  that  were  familiar  with  the  task  were  thanked  for  their 
willingness  to  participate  and  excused  from  testing. 


Procedure 


The  subjects  were  assembled  In  one  building  and  Informed  consent  was  obtained.  The  test 
administrator.  In  turn,  escorted  each  subject  to  an  adjacent  building  where  specific  details  of 
the  project  were  explained  to  each  Individually.  Upon  arrival  In  the  testing  room,  subjects  were 
seated  In  front  of  the  Pyramid  Task,  and  a  Koss  K/6X  stereophonic  headset  was  placed  over  their 
ears.  All  further  Instructions  were  received  through  the  headset  to  ensure  an  analogous 
environment  existed  across  all  four  conditions.  Each  of  the  four  experimental  conditions  used 
different  methods  to  present  the  following  Pyramid  Task  Instructions  to  the  subjects: 


In  front  of  you  Is  a  Pyramid  Task.  This  task  consists  of  a  wooden  board  with  three 
pegs  placed  In  a  row.  The  object  of  this  task  Is  to  move  all  seven  blocks  of  wood  from 
the  peg  on  one  side  of  the  board,  to  the  peg  on  the  other  side  of  the  board.  However, 


and  this  Is  Important,  you  may  never  place  a  larger  block  on  top  of  a  smaller  block  as 
you  attest  to  wove  all  seven  blocks  from  one  side  of  the  board  to  the  other.  You  say 
restart  froa  the  beginning  as  often  as  you  wish.  You  aay  stop  this  task  at  any  tlae  by 
telling  the  experlaenter  you  wish  to  stop.  There  Is  no  penalty  for  stopping  or  for 
starting  over.  Once  again,  you  nay  only  place  a  saaller  block  on  a  larger  block.  Good 
luck.  Are  there  any  questions? 

The  first  experlaental  condition  consisted  of  the  experlaenter  reading  the  Pyraald  Task 
Instructions  to  the  subject.  Following  this  Instructional  presentation  and  after  the  subject 
Indicated  that  he/she  understood  the  Instructions,  the  experlaenter  Instructed  the  subject  to 
begin. 

The  second  experlaental  condition  followed  the  saae  experlaental  procedure  of  the  previous 
condition,  but  eaployed  one  subtle  change:  While  the  experlaenter  read  the  Instructions  aloud  to 
the  subject,  the  subject  followed  along  using  his  or  her  typed  copy  of  the  Instructions.  The 
Instructions  were  typed  on  white  bond  typing  paper  (8  1/2  X  11  Inches).  All  other  aspects  of  the 
study  reaalned  the  saae. 

Thus,  In  the  first  condition,  the  experlaenter  aerely  read  the  typed  Instructions  to  the 
subject,  whereas,  In  the  second  condition,  the  Instructions  were  read  by  the  experlaenter  to  the 
subject,  while  the  subject  followed  along  on  the  typed  copy  of  the  Instructions  placed  In  front 
of  hla/her. 

The  third  condition  was  conceptually  and  procedurally  analogous  to  the  first  condition  except 
that  Instead  of  using  the  experimenter's  voice,  the  subject  was  "electrophonlcally'1  presented 
with  the  saae  task  Instructions  using  CGS.  The  CGS  used  In  this  project  was  electrophonlcally 
presented  using  Digital  Equipment  Corporation's  DECtalk,  which  produces  unusually  high-quality 
speech  by  using  a  unique  three-level  processing  approach  that  avoids  the  monotone,  robot-like 
voices  of  lower-quality  systems. 

The  fourth  condition  was  procedurally  and  conceptually  similar  to  the  third  condition  except 
that  an  11 -Inch  monochroae  VDT  was  provided  for  the  subject  to  follow  along  as  the  task 

Instructions  were  presented  to  the  subject  using  CGS.  An  Alcyon  microcomputer  was  used  to 

generate  black  letters  against  a  white  background  on  the  480-  by  530-pixel  VDT. 

Thus,  the  same  Instructions  were  presented  using  four  different  methods  of  presentation:  (a) 
human  voice,  (b)  voice  plus  paper  script,  (c)  computer-generated  speech,  and  (d)  CGS  combined 
with  VDT  script.  Consequently,  the  first  and  third  conditions  are  conceptually  similar,  as  are 
the  second  and  fourth  conditions.  At  the  conclusion  of  the  experiment,  each  subject  was 

debriefed  and  escorted  to  a  casual  area  to  prevent  contact  with  other  subjects  awaiting  the 

experiment. 


III.  RESULTS 


Dependent  Variables 

The  dependent  variables  In  this  project  were  the  number  of  times  the  subject  started  over, 
the  persistence  exhibited  by  the  subjects  on  the  task,  and  gender  differences  associated  with  the 
time  taken  to  make  the  Initial  move. 

A  one-way  analysis  of  variance  (ANOVA)  was  conducted  to  Investigate  differences  between  the 
four  conditions  In  the  number  of  times  the  subjects  started  over  on  the  pyramid  task.  The  ANOVA 
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revealed  that  the  subjects  In  the  second  condition  (l.e.,  paper  script  plus  experl center's  voice) 
had  significantly  sore  restarts  (M  -  1.18)  than  did  the  subjects  In  the  voice-only  condition  (M  * 
0.63),  the  CSS  condition  (M  «  0.78),  or  the  CSS  with  VDT  condition  (M  -  0.63),  F_  (3,  156)  -  2.84, 
p  ^  .05. 

An  additional  analysis  was  performed  to  determine  If  any  differences  In  persistence  existed 
across  the  experimental  conditions.  A  lack  of  persistence  was  measured  by  counting  the  number  of 
subjects  who  quit  working  on  the  task  before  their  time  limit  had  expired.  A  complex  Chi-Square 
analysis  revealed  significant  differences  between  conditions  (x2  (3,  N  *  47)  ■  8.14,  £  <  .05), 
with  more  people  quitting  when  they  were  In  the  voice  plus  paper  script  (n  *  19)  or  CGS  plus  VDT 
(n  »  18)  conditions  than  when  they  were  presented  the  Instructions  using  only  voice  (n  *  6)  or 
only  CGS  (n  -  4). 

A  final  analysis  was  performed  to  Investigate  for  gender  differences  In  the  time  It  took  to 
actually  perform  the  first  move.  Collapsing  across  all  four  conditions  revealed  that  a 
significant  difference  existed  between  genders  (with  males  moving  sooner  (M  *  9.52  seconds)  than 
females  (M  «  15.05  seconds),  £(149)  ■  12.75,  £  <  .001).  Those  subjects  who  made  their  Initial 
move  before  the  Instructions  were  completed  were  eliminated  from  the  analysis  and  account  for  the 
different  degrees  of  freedom  between  this  t-test  and  the  ANOVA  above. 


Discussion 


A  reason  for  these  findings  could  be  that  a  considerable  amount  of  Information  Is  conveyed 
without  the  added  requirement  of  attending  to  a  script  or  a  VDT  during  Its  presentation.  When 
this  dual  requirement  for  one's  attention  exists.  It  may  be  that  certain  aspects  of  one's  social 
background  cause  timesharing  of  one's  concentration  between  the  visual  and  hearing  senses.  This 
sharing  of  attention  may  contribute  to  a  deficit  In  concentration,  or  performance,  and  could 
explain  the  lack  of  persistence  in  the  Instructional  conditions  that  used  both  an  aural  and  a 
visual  method  of  presentation. 

It  may  be  that  the  greater  number  of  restarts  In  the  paper  script  plus  human  voice  condition 
could  be  explained  by:  the  subjects'  having  a  social  desire  to  make  eye  contact  with  the 
experimenter  (who  was  verbalizing  the  script  to  them),  plus  also  dividing  their  concentration 
between  the  experimenter's  voice  and  the  script  in  front  of  them.  This  timesharing  may  have 
resulted  In  either  not  understanding  the  Instructions  as  well  as  their  peers  did  in  the  other 
conditions,  or  In  not  being  able  to  begin  formulating  a  strategy  while  the  Instructions  were 
being  presented,  or  prior  to  beginning  the  task. 

Implications  exist  for  any  test  that  uses  a  test  proctor  to  read  the  Instructions  to  the  test 
taker,  numerous  examples  of  this  testing  procedure  (l.e.,  where  the  proctor  reads  Instructions 
to  the  test  taker  while  he/she  follows  along)  exist  in  both  commercial  and  military  aptitude 
tests.  The  evidence  obtained  during  this  project  suggests  that  this  procedure  may  not  be  needed, 
and  In  fact,  could  be  detrimental  to  individual  performance  during  the  test. 

As  the  popularity  of  computers  Increases,  the  gender  differences  noted  In  this  project  may 
decrease  over  time.  However,  until  computer  familiarity  across  genders  and  different 
socioeconomic  groups  Is  established  and  computer  fear  Is  eliminated,  the  societal  Implications 
associated  with  computer-based  testing  and  education  remain  largely  unknown.  The  challenge  that 
faces  behavioral  scientists  Is  to  pursue  proactively  and  shape  equitably,  when  possible,  the 
questions  and  answers  to  this  burgeoning  new  field  of  technology  between  man  and  machine. 
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